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Output Set: N:\CRF3\02072002\I940166A.raw 



SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Blank, Gregory S. 

Narindray, Daljit S. 
Zapata, Gerardo A. 
(ii) TITLE OF INVENTION: Protein Recovery 
(iii) NUMBER OF SEQUENCES: 7 
(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Genentech, Inc. 
STREET: 1 DNA Way 
CITY: South San Francisco 
STATE: California 

(E) COUNTRY: USA 

(F) ZIP : 94080 
(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: 3.5 inch, 1.44 Mb floppy d 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: WinPatin (Genentech) 
(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US/09/940 , 166A 

(B) FILING DATE: 27-Aug-2001 

(C) CLASSIFICATION: 
(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 09/097,309 

(B) FILING DATE: 13-JUN-1997 
(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Schwartz, Timothy R. 

(B) REGISTRATION NUMBER: 32171 

(C) REFERENCE/DOCKET NUMBER: P1105R1 
(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 650/225-7467 

(B) TELEFAX: 650/952-9881 
(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 241 amino acids 

(B) TYPE: Amino Acid 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1: 
Glu Val Gin Leu Val Glu Ser Gly Gly Gly Leu Val Gin 

1 5 10 

Gly Ser Leu Arg Leu Ser Cys Ala Thr Ser Gly Tyr Thr 
20 25 





isk 



Pro Gly 
15 

Phe Thr 
30 



'C:\CRF3\Outhold\VsrI940166A.htm 



2/7/02 



Page 2 of 7 



RAW SEQUENCE LISTING DATE : 02/07/2002 

PATENT APPLICATION: US/09/94 0 , 166A TIME: 09:02:11 

Input Set : Ni\Crf3\RULE60\09940166A.txt 
Output Set: N:\CRF3\02072002\I940166A.raw 

59 Glu Tyr Thr Met His Trp Met Arg Gin Ala Pro Gly Lys Gly Leu 

35 40 45 

Glu Trp Val Ala Gly He Asn Pro Lys Asn Gly Gly Thr Ser His 

50 55 60 



60 
62 
63 



65 Asn Gin Arg Phe Met Asp Arg Phe Thr He Ser Val Asp Lys Ser 

66 65 70 75 

68 Thr Ser Thr Ala Tyr Met Gin Met Asn Ser Leu Arg Ala Glu Asp 

69 80 85 90 

71 Thr Ala Val Tyr Tyr Cys Ala Arg Trp Arg Gly Leu Asn Tyr Gly 

72 95 100 105 

74 Phe Asp Val Arg Tyr Phe Asp Val Trp Gly Gin Gly Thr Leu Val 

75 HO 115 120 

77 Thr Val Ser Ser Ala Ser Thr Lys Gly Pro Ser Val Phe Pro Leu 

78 125 130 135 

80 Ala Pro Ser Ser Lys Ser Thr Ser Gly Gly Thr Ala Ala Leu Gly 

81 . 140 145 150 

83 Cys Leu Val Lys Asp Tyr Phe Pro Glu Pro Val Thr Val Ser Trp 

84 155 160 165 

86 Asn Ser Gly Ala Leu Thr Ser Gly Val His Thr Phe Pro Ala Val 

87 170 175 180 

89 Leu Gin Ser Ser Gly Leu Tyr Ser Leu Ser Ser Val Val Thr Val 

90 185 190 195 

92 Pro Ser Ser Ser Leu Gly Thr Gin Thr Tyr He Cys Asn Val Asn 

93 200 205 210 

95 His Lys Pro Ser Asn Thr Lys Val Asp Lys Lys Val Glu Pro Lys 

96 215 220 225 

98 Ser Cys Asp Lys Thr His Thr Cys Pro Pro Cys Pro Ala Pro Glu 

99 230 235 240 

101 Leu 

102 241 

104 (2) INFORMATION FOR SEQ ID NO : . 2 : 
10 6 (i) SEQUENCE CHARACTERISTICS: 

107 (A) LENGTH: 214 amino acids 

108 (B) TYPE: Amino Acid 

109 (D) TOPOLOGY: Linear 

111 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2: 

113 Asp He Gin Met Thr Gin Ser Pro Ser Ser Leu Ser Ala Ser Val 

114 15 10 15 

116 Gly Asp Arg Val Thr He Thr Cys Arg Ala Ser Gin Asp He Asn 

117 20 25 30 

119 Asn Tyr Leu Asn Trp Tyr Gin Gin Lys Pro Gly Lys Ala Pro Lys 

120 35 40 45 

122 Leu Leu He Tyr Tyr Thr Ser Thr Leu His Ser Gly Val Pro Ser 

123 50 55 60 
12 5 Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp Tyr Thr Leu Thr He 
126 65 70 75 

128 Ser Ser Leu Gin Pro Glu Asp Phe Ala Thr Tyr Tyr Cys Gin Gin 

129 80 85 90 
131 Gly Asn Thr Leu Pro Pro Thr Phe Gly Gin Gly Thr Lys Val Glu 
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Ser 
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Ser 
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Thr 
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His 
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132 
134 
135 
137 
138 
140 
141 
143 
144 
146 
147 
149 
150 
152 
153 
155 
156 

158 (2) INFORMATION FOR SEQ ID NO : 3 : 

160 (i) SEQUENCE CHARACTERISTICS: 

161 (A) LENGTH: 36 amino acids 

162 (B) TYPE: Amino Acid 

163 (D) TOPOLOGY: Linear 
165 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

167 Leu Gly Gly Arg Met Lys Gin Leu Glu Asp Lys Val Glu Glu Leu 

168 15 10 15 
17 0 Leu Ser Lys Asn Tyr His Leu Glu Asn Glu Val Ala Arg Leu Lys 
171 20 25 30 

173 Lys Leu Val Gly Glu Arg 

174 35 36 

17 6 (2) INFORMATION FOR SEQ ID NO : 4: 

178 (i) SEQUENCE CHARACTERISTICS: 

179 (A) LENGTH: 7 amino acids 

180 (B) TYPE: Amino Acid 

181 (D) TOPOLOGY: Linear 
183 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

W--> 185 Leu Xaa Xaa Xaa Xaa Xaa Xaa 
186 1 5 7 

18 8 (2) INFORMATION FOR SEQ ID NO : 5: 

190 (i) SEQUENCE CHARACTERISTICS: 

191 (A) LENGTH: 2143 base pairs 

192 (B) TYPE: Nucleic Acid 

193 (C) STRANDEDNESS: Single 

194 (D ) TOPOLOGY: Linear 
196 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

199 GAATTCAACT TCTCCATACT TTGGATAAGG AAATACAGAC ATGAAAAATC 50 
201 TCATTGCTGA GTTGTTATTT AAGCTTTGGA GATTATCGTC ACTGCAATGC 100 
203 TTCGCAATAT GGCGCAAAAT GACCAACAGC GGTTGATTGA TCAGGTAGAG 150 
205 GGGGCGCTGT ACGAGGTAAA GCCCGATGCC AGCATTCCTG ACGACGATAC 200 
207 GGAGCTGCTG CGCGATTACG TAAAGAAGTT ATTGAAGCAT CCTCGTCAGT 250 
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RAW SEQUENCE LISTING DATE: 02/07/2002 

PATENT APPLICATION: US/09/94 0 , 166A TIME: 09:02:11 
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209 AAAAAGTTAA TCTTTTCAAC AGCTGTCATA AAGTTGTCAC GGCCGAGACT 3 00 

211 TATAGTCGCT TTGTTTTTAT TTTTTAATGT ATTTGTAACT AGAATTCGAG 3 50 

213 CTCGCCGGGG ATCCTCTAGA GGTTGAGGTG ATTTTATGAA AAAGAATATC 400 

215 GCATTTCTTC TTGCATCTAT GTTCGTTTTT TCTATTGCTA CAAACGCGTA 4 50 

217 CGCTGATATC CAGATGACCC AGTCCCCGAG CTCCCTGTCC GCCTCTGTGG 500 

219 GCGATAGGGT CACCATCACC TGTCGTGCCA GTCAGGACAT CAACAATTAT 550 

221 CTGAACTGGT ATCAACAGAA ACCAGGAAAA GCTCCGAAAC TACTGATTTA 600 

22 3 CTATACCTCC ACCCTCCACT CTGGAGTCCC TTCTCGCTTC TCTGGTTCTG 650 

22 5 GTTCTGGGAC GGATTACACT CTGACCATCA GCAGTCTGCA ACCGGAGGAC 700 

2 27 TTCGCAACTT ATTACTGTCA GCAAGGTAAT ACTCTGCCGC CGACGTTCGG 7 50 

22 9 ACAGGGCACG AAGGTGGAGA TCAAACGAAC TGTGGCTGCA CCATCTGTCT 800 

231 TCATCTTCCC GCCATCTGAT GAGCAGTTGA AATCTGGAAC TGCCTCTGTT 850 

233 GTGTGCCTGC TGAATAACTT CTATCCCAGA GAGGCCAAAG TACAGTGGAA 900 

235 GGTGGATAAC GCCCTCCAAT CGGGTAACTC CCAGGAGAGT GTCACAGAGC 950 

237 AGGACAGCAA GGACAGCACC TACAGCCTCA GCAGCACCCT GACGCTGAGC 1000 

239 AAAGCAGACT ACGAGAAACA CAAAGTCTAC GCCTGCGAAG TCACCCATCA 1050 

241 GGGCCTGAGC TCGCCCGTCA CAAAGAGCTT CAACAGGGGA GAGTGTTAAG 1100 

243 CTGATCCTCT ACGCCGGACG CATCGTGGCG CTAGTACGCA AGTTCACGTA 1150 

245 AAAACGGTAT CTAGAGGTTG AGGTGATTTT ATGAAAAAGA ATATCGCATT 1200 

247 TCTTCTTGCA TCTATGTTCG TTTTTTCTAT TGCTACAAAC GCGTACGCTG 1250 

24 9 AGGTTCAGCT GGTGGAGTCT GGCGGTGGCC TGGTGCAGCC AGGGGGCTCA 1300 

251 CTCCGTTTGT CCTGTGCAAC TTCTGGCTAC ACCTTTACCG AATACACTAT 1350 

253 GCACTGGATG CGTCAGGCCC CGGGTAAGGG CCTGGAATGG GTTGCAGGGA 1400 

255 TTAATCCTAA AAACGGTGGT ACCAGCCACA ACCAGAGGTT CATGGACCGT 1450 

2 57 TTCACTATAA GCGTAGATAA ATCCACCAGT ACAGCCTACA TGCAAATGAA 1500 

259 CAGCCTGCGT GCTGAGGACA CTGCCGTCTA TTATTGTGCT AGATGGCGAG 1550 

261 GCCTGAACTA CGGCTTTGAC GTCCGTTATT TTGACGTCTG GGGTCAAGGA 1600 

263 ACCCTGGTCA CCGTCTCCTC GGCCTCCACC AAGGGCCCAT CGGTCTTCCC 1650 

265 CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG GCCCTGGGCT 1700 

267 GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA 1750 

269 GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC 1800 

271 AGGACTCTAC TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG 1850 

273 GCACCCAGAC CTACATCTGC AACGTGAATC ACAAGCCCAG CAACACCAAG 1900 

275 GTCGACAAGA AAGTTGAGCC CAAATCTTGT GACAAAACTC ACACATGCCC 1950 

277 GCCGTGCCCA GCACCAGAAC TGCTGGGCGG CCGCATGAAA CAGCTAGAGG 2000 

279 ACAAGGTCGA AGAGCTACTC TCCAAGAACT ACCACCTAGA GAATGAAGTG 2050 

281 GCAAGACTCA AAAAGCTTGT CGGGGAGCGC TAAGCATGCG ACGGCCCTAG 2100 

283 AGTCCCTAAC GCTCGGTTGC CGCCGGGCGT TTTTTATTGT TAA 2143 

285 (2) INFORMATION FOR SEQ ID NO : 6 : 

287 (i) SEQUENCE CHARACTERISTICS: 

288 (A) LENGTH: 237 amino acids 

289 (B) TYPE: Amino Acid 

290 (D ) TOPOLOGY: Linear 

292 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

294 Met Lys Lys Asn He Ala Phe Leu Leu Ala Ser Met Phe Val Phe 

295 -23 -20 -15 -10 

297 Ser He Ala Thr Asn Ala Tyr Ala Asp He Gin Met Thr Gin Ser 

298 -5. 1 5 

300 Pro Ser Ser Leu Ser Ala Ser Val Gly Asp Arg Val Thr He Thr 
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VERIFICATION SUMMARY DATE: 02/07/2002 

PATENT APPLICATION : US/09/940 , 166A TIME: 09:02:12 



Input Set : N:\Crf3\RULE60\09940166A.txt 
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L:28 M:220 C: Keyword misspelled or invalid format , [(A) APPLICATION NUMBER : ] 
L:29 M:220 C: Keyword misspelled or invalid format, [(B) FILING DATE:] 
L:185 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 4 
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